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Abstract 

In dynamical systems such as cellular automata and iterated maps, it 
is often useful to look at a language or set of symbol sequences produced 
by the system. There are well-established classification schemes, such as 
the Chomsky hierarchy, with which we can measure the complexity of 
these sets of sequences, and thus the complexity of the systems which 
produce them. 

In this paper, we look at the first few levels of a hierarchy of complexity 
for two-or-more-dimensional patterns. We show that several definitions 
of "regular language" or "local rule" that are equivalent in d — 1 lead to 
distinct classes in d > 2. We explore the closure properties and compu- 
tational complexity of these classes, including undecidability and L, NL 
and NP-completeness results. 

We apply these classes to cellular automata, in particular to their sets 
of fixed and periodic points, finite-time images, and limit sets. We show 
that it is undecidable whether a CA in d > 2 has a periodic point of 
a given period, and that certain "local lattice languages" are not finite- 
time images or limit sets of any CA. We also show that the entropy of 
a d-dimensional CA's finite-time image cannot decrease faster than t~ d 
unless it maps every initial condition to a single homogeneous state. 
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1 Introduction 



1.1 One-dimensional languages in physics 

Consider a dynamical system, for instance an iterated map F acting on some 
space U. If we partition U into k subsets U\, U2, ■ ■ ■ Uf., then for any initial 
point x we can write down a sequence (a*) of symbols describing which subset 
it falls into at each time-step; i.e. a t = j if F (x) is in Uj. This sequence then 
describes a coarse history or itinerary of x. If the map is invertible, a can also 
be extended backwards, producing a bi-infinite sequence. 

A logical question to ask, then, is: what possible sequences can the system 
produce? In other words, for what sequences is there an x with that sequence as 
its itinerary? This set of sequences is called the symbolic dynamics of the map 
F, and can be a very useful way to classify the system; often the partition can 
be chosen so that the map between points and sequences is one-to-one, allowing 
us to enumerate its periodic points |2Q| and calculate quantities like entropies, 
escape rates and Liapunov exponents |lj] . 

As another example, consider a cellular automaton (CA) in one dimension. 
This is a dynamical system on sequences where each site is updated according 
to some local rule, as a function of its state and those of its neighbors; for 
instance, suppose the state at each site is or 1, and F(a)i = /(aj_i, dj, ai+i) 
for some Boolean function /. Then we can ask a variety of questions, such 
as: what sequences (dj) are in the image of F after one iteration? After two? 
What sequences are fixed points, i.e. F(a) — a? What sequences are periodic 
points, in that F t (a) — a for some tl What sequences map onto the zero state, 
F(a) = (0)? And what points arc in the limit set, the intersection of the images 
of F l for all t > 0? 

All these questions refer, as does the symbolic dynamics question above, 
to sets of sequences, or languages. Clearly some languages are more complex 
than others; the set of sequences {10 p l | p prime} of two l's separated by a 
prime number of 0's, for instance, is clearly more complex than the set of words 
containing an equal number of 0's and l's, which in turn is more complex than 
the set of sequences where two l's never occur consecutively. This qualitative 
notion of complexity is formalized by the Chomsky hierarchy (e.g. |23|]), in which 
languages are classified by the different types of machines needed to recognize 
or generate them. Originally proposed by Noam Chomsky as a set of models of 
natural language, this hierarchy has since been taken up by computer scientists 
and others seeking to quantify the notion of complexity. 

The basic Chomsky classes are called, from simplest to most complex, regu- 
lar, context-free, context-sensitive, and unrestricted; these correspond to increas- 
ingly powerful kinds of machines, at the top of which sits the Turing machine 
(e.g. f4l|| ) which, according to the Church- Turing thesis, is computationally uni- 
versal. 

In fact, examples all up and down this hierarchy can be found in dynamical 
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systems theory. The languages generated by many simple hyperbolic systems are 
regular; this corresponds to the existence of a finite Markov partition for their 
dynamics [ p0[ . At phase transitions such as the period-doubling fixed point, 
however, they can have a complicated scale-invariant structure, and belong to 
an intermediate class called indexed context-free || ; and the iteration of smooth 
maps in the plane can correspond to universal Turing machines [ [f3| . 

Similarly, the image of a cellular automaton after a finite number of time- 
steps is regular |]50|] , as is the set of fixed points; but limit sets can be context- 
free, context-sensitive, or the complement of the halting set of a Turing machine 

s 

The purpose of this paper is to introduce the reader to an analogous hierarchy 
of two-dimensional "languages" or patterns of symbols. This hierarchy turns out 
to be much richer than in one dimension, in that several equivalent definitions of 
regular languages generalize in subtle ways to become distinct classes in d > 2. 
We hope that such a hierarchy will allow us to more clearly discuss issues of 
complexity in spin systems, cellular automata, coupled map lattices, and other 
systems in two or more dimensions. 

To provide background, we review the definitions of regular and context-free 
languages in d — 1. Readers interested more in this subject should consult [^3| 
or another text on the theory of languages and automata. 

1.2 Equivalent descriptions of regular languages 

The recognition machine is a paradigmatic object in language theory. It is fed a 
word as input, and accepts or rejects it according to whether or not that word 
is in the language. The simplest kind of machine is the deterministic finite-state 
automaton (DFA): it consists of a box with an internal state in some finite set 
S, which reads a tape on which a candidate word is written. 

Suppose the language is written in some set of symbols or alphabet A. Then 
the DFA reads the tape from left to right, letting its state at each step depend 
its old state and the symbol it is currently reading according to a transition 
function F : A x S — > S. After it reaches the end of the tape, it accepts the 
word if its final state is in some subset 5*accept of S, and rejects it otherwise. 

For example, consider the language L-^ consisting of words of the alphabet 
{a, 6}, where the only rule is that no two &'s may occur consecutively. This 
language is accepted by a DFA with three internal states, A, B, and R for 
'Reject', with F described by 
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R 


R 



Then we start in state A, accept if we end up in A or B, and reject if we end 
up in R. 
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Wc define a language as regular if it is recognized by some DFA. This em- 
bodies the idea of a language where only a finite amount of memory is required 
to recognize it. 

There are several ways to generalize DFA's in an effort to make them more 
powerful: for instance, we can consider non- deterministic finite-state automata 
or NFA's. Their dynamics consists of a function F : Ax S ^ p{S) whose values 
are subsets of S, giving the machine one or more choices at each step of which 
state to adopt. We say that an NFA accepts a word if there exists some set of 
choices which leads it to an accepting state. 

Non-deterministic machines are in general more powerful than deterministic 
ones, since their definition of acceptance allows them to test many possibilities 
simultaneously: for instance, it is believed that many problems can be solved in 
polynomial time non-deterministically but not deterministically (i.c, P 7^ NP). 

However, in the case of finite-state automata, the NFA is no more powerful 
than the DFA. Create a DFA whose states are subsets of S, S' = p(S). Then 
let F'(a, s') = U ses 'F(a, s), and define a state s' S S' as accepting if it contains 
some accepting state, i.e. Sacccpt = W e p(S) I s ' n ^accept 7^ 0}- Clearly this 
DFA will accept the word if and only if the NFA has an accepting trajectory. So 
NFA's can be simulated by DFA's, and can recognize the same class of languages 
(although the equivalent DFA might be exponentially larger). 

Another seemingly more powerful machine we could consider is a two-way 
finite-state automaton (2DFA or 2NFA) which can move both left and right on 
its input tape — surely an advantage, since it can go back and recall previous 
characters of the input. But in fact this is no more powerful than the one-way 
kind: construct an automaton with states representing a record of all the times 
and states in which the 2-way FA visited a given place on the tape. These 
crossing sequences are finite, since a 2FA with n internal states can visit each 
site no more than n times without falling into a loop. A local matching rule, 
enforceable by an NFA, then ensures that crossing sequences at adjacent sites 
are consistent; details are given in 0]. 

So in one dimension we can say that 

DFA = NFA = 2DFA = 2NFA 

since all these machines recognize regular languages. 

The class of regular languages is also preserved under a variety of operations. 
Let a homomorphism h map the alphabet A onto some smaller alphabet h(A), 
collapsing some symbols and losing information in the process; mapping both 
a and b onto a, for instance. If we do this to a regular language, the resulting 
language is again regular, since an NFA can guess which of the original symbols 
in A it should use. 

Regular languages have a number of other descriptions, including: 
1.) Transition graphs. If we label nodes with states and edges with tape 
symbols, DFA's and NFA's can be written as directed graphs. Such a graph 
has a transition matrix M a for each symbol a, and the number N(l) of allowed 
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words of length I has the leading behavior A where A is the largest eigenvalue 
of their sum M — J2 a M «- 

2. ) Regular expressions. If we read '+' as 'or', multiplication as concate- 
nation, w* as '0 or more repetitions of w\ and e as the empty string, regular 
languages are those which can be expressed with a finite formula using these 
operators. For instance, L^F can be written (a + ba)*(e + b) or (e + b)(aa*b)*a*. 

3. ) Regular grammars. A grammar |2j| consists of a set of symbols V, 
including a start symbol 5* and a set P of production rules a — > /?, where a 
and (3 are strings of symbols. The language generated by the grammar consists 
of the strings that can be derived from S by applying the production rules in 
arbitrary order. 

Regular grammars only have productions of the form A — > wB and A — -> w, 
where A and B are symbols and w is a string of terminal symbols that cannot 
change or create more symbols. Thus the variables move to the right, leaving a 
string of terminals behind them. For instance, the grammar 

S — > aS or bA or e 
A — > aS or e 

generates the strings of Lj^. 

4. ) Positive pumping lemmas. A useful property of regular languages is 
the pumping lemma, which states that any sufficiently long string x in a regular 
language L can be written as x = yzw where yz n w € L for all n > 0. This can 
often be used to show that a language is non-regular. There are positive versions 
of the pumping lemma that are both necessary and sufficient for a language to 
be regular [BTj. 

5. ) Rational formal power series. Consider the formal sum 

e + a + b + aa + ab + ba + aaa + aab + aba + baa + bab + aaaa + . . . 

of all the words in L-^. This sum can be viewed as the expansion of the rational 
function 

i — L ^r( 1 + 6 ) 

1 — a — ba 

where a and b are non-commuting variables and 1 = e. The power series of 
rational functions in non-commuting variables correspond exactly to the regular 
languages; there is a beautiful theory of such series outlined in [ p2[ . 

6. ) Equivalence classes. In a language L, we can define two words as 
equivalent, u ~ v, if they can be followed by the same set of suffixes, i.e. ux S L 
if and only if vx e L. Then a language is regular if and only if ~ has only a finite 
number of equivalence classes, which correspond to the states of the smallest 
DFA that recognizes it. Thus the smallest DFA is unique up to isomorphism. 
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1.3 Finite complement languages 

The language L-^ is an example of a finite complement (f.c.) language, in that 
it can be defined by a finite list of forbidden substrings, namely {bb}. Thus the 
language can be described in a purely local way; equivalently, we could list the 
allowed blocks {aa, ab, ba} of length 2. Sets of infinite sequences defined in this 
way are called subshifts of finite type (e.g. p^|). 

Clearly any finite complement language is regular, but the reverse is not 
the case: for instance, (a*ba*c)* is regular but not f.c, since an infinite set of 
substrings ba*b and ca*c would have to be excluded. Whether the last non-o 
was a b or a c is a 'hidden state', obscured by arbitrarily large blocks of a's. 

However, every regular language is a homomorphism of some f.c. language. 
If we label the edges of the transition graph with distinct symbols, we get 
(a*bd*c)* , which is f.c; its allowed blocks are aa, ab, bd, dd, dc and ca. We can 
ensure that we start and end with the proper states by allowing only jja and cjj, 
where (j is a putative blank character which lies outside the word. By mapping 
d to a, we recover the original set. (In symbolic dynamics, homomorphisms of 
subshifts of finite type are called sofic systems pSfl .) 

In any case, homomorphisms of f.c languages, which we will call h(LLL)'s 
below, are yet another way to define regular languages in one dimension. 

1.4 Context-free languages 

In several places, we will use context-free languages, the second lowest level in 
the Chomsky hierarchy p3| ; they properly contain the regular languages. A 
language is context-free if it is recognized by a push- down automaton (PDA), 
a finite-state machine with access to a stack memory. On reading an input 
symbol, it can read (and pop) the top symbol of the stack, update its internal 
state, and/or push new symbols onto the stack. It accepts if it starts and ends 
with an empty stack. 

The canonical context-free language is the Dyck language{e, (), (()), ()(), (())(), . . .} 
of well-formed words of parentheses; another example is the set {a n b n } of words 
consisting of a block of a's followed by an equal number of 6's. Both of these 
languages are context-free but not regular. 

2 Two-dimensional languages 

How do various definitions of regular language generalize in two or more dimen- 
sions? We will show that DFA's, NFA's and homomorphisms of finite comple- 
ment languages, which were all equivalent to regular languages in one dimension, 
become distinct classes of increasing subtlety, Even finite complement languages 
(which we call Local Lattice Languages, or LLL's) are capable of structure much 
more subtle than in the one-dimensional case. 
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In essence, each of these classes represents a different concept of locality in a 
system's structure. Apparently the distinction between local and global in two 
or more dimensions is actually quite tricky and different attempts to capture it 
lead to very different sets of languages. 

2.1 Notation 

If A is a finite alphabet, let £ = A ZxZl be the set of infinite two-dimensional 
pictures or arrays of symbols in A, and let S m ,n = A mxn be the set of m X n 
blocks. In analogy with one-dimensional languages, we will usually construct 
languages of finite blocks, L C U m .„S] min ; however, we are also interested in 
sets of infinite configurations, L M C E. If these are closed and translationally 
invariant, they are called subshifts as in the one-dimensional case (e.g. p2|). 

To translate back and forth between finite and infinite blocks, we introduce 
extension and restriction operators E and R. If L and are languages of 
finite and infinite configurations respectively, let 

Eoo(L) = {BeZ\VbcB:beL} 

| 3B S Loo : b C B} 

where by b C B we mean that b is a sub-block of B. Then is the set 

of infinite configurations containing only blocks in L, and i?(Loo) is the set of 
finite blocks appearing in infinite configuations in L^. To extend finite blocks 
to larger but still finite blocks, we define 

E(L) ={Be U m , n S ro , n \VbcB:beL} 

which is the set of finite blocks containing blocks in L. 

Note that E and R are by no means inverses of each other! The set of 
infinitely extensible blocks R{E oc {L)) is typically a proper subset of L, and is 
often of a higher order of complexity than L. In fact, even its non-emptiness is 
undecidable, as we will see below. 

We will often be interested in the number of allowed blocks of a certain 
size; we will call the number ofrnxw rectangles in L the growth function of L, 
N(m, n). The entropy per site is then 

log N(m,n) 
a = hm 

m — >oo,n — >oo mn 

if this limit exists. (Since we are simply counting states without a notion of 
measure, this is often called the topological entropy.) 

Finally, we need to say something about boundary conditions. We imagine 
our finite blocks as surrounded by a special blank symbol jj. By interacting 
with these jj's, our various recognition schemes can detect the block's edges and 
corners. 
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2.2 Local Lattice Languages, or LLL's 

Suppose the only allowed 2x2 blocks in a 2-d language of ^'s and 4k 's are 
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and their rotations. Alternately, we could say that the block #| and its rota- 
tions are excluded. Then the only allowed configurations consist of rectangular 
blocks of 4|k's floating in a sea of 'sP's, such as 
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We call this language L r ect, and discuss it further in Section 3. In general, we 
say 

Definition. A two-dimensional language L is a local lattice language (LLL) 
if there exists a finite set of blocks £ a iiowed such that L = -E(£ a iiowed)- 

In other words, L can be defined by listing a finite number of blocks of a 
given size or shape, and demanding that pictures in L contain those blocks and 
no others. The diameter of the largest block in L a iiowed is called the range of L. 
Clearly, LLL's are analogous to the finite complement languages defined above. 
We can give several physical examples: 

1. ) Defect-free ground states of lattice Hamiltonians with local 
interactions. If configurations exist where the local Hamiltonian is minimized 
everywhere, then the neighborhood(s) which minimize it form the allowed blocks 
of an LLL. Conversely, every LLL is the ground state of some local lattice 
Hamiltonian, which assigns a zero energy to allowed blocks and a positive energy 
to others. 

Even if the ground states are frustrated, in that they do not locally minimize 
the Hamiltonian, they can often (but not always ]3q| ) be represented by an LLL 
of larger range. For instance, the set of ground states of the antiferromagnet on 
the triangular lattice is an LLL where the allowed triangles have two |'s and 
one I, or vice versa; this defines a 3-point Hamiltonian with the same ground 
states which is locally minimizable. 

2. ) Space-time histories of 1-d cellular automata or Turing ma- 
chines. If / is a one-dimensional nearest-neighbor CA rule a[ = /(aj_i, di, dj+i), 

then if we allow blocks of the form [ 
evolution from row to row 



1 a 


b 


» 1 




f(a,b,c) 





'lUllUIl 1IU111 1UW LU IUW. I I 

In particular, the CA rule can simulate 



]the LLL with simulate the CA's 
a Turing machine, where special 
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states correspond to the machine's head and internal states, while others corre- 
spond to its tape symbols J35|. Then its input appears along the top row, and 
we can use the LLL to require that it halts or doesn't halt before it reaches 
the bottom. Thus simple questions about 2-d LLL's can be equivalent to the 
Halting Problem; this will be our main source of undecidability. 

3.) Fixed and periodic points in 2-d cellular automata. We will 
show this in Section 3. 

Just as many statistical mechanics models can be exactly solved in one di- 
mension but not in two, 2-d LLL's are often much more subtle than their one- 



dimensional counterparts. Consider for example the LLL where 



1 1 



and 



are excluded; a lattice gas where no two adjacent sites may be occupied. In one 
dimension this is just Lrr again, and the entropy per site is a — logr where 
r = (vo+ l)/2 is the golden mean. However, although a has been calculated 
to high accuracy in the two-dimensional case jlO|, O, it is not known exactly; 
one the hexagonal lattice, on the other hand, an analytic solution exists [||. 

As an example of a zero-entropy LLL, consider the rule where every 2x2 
block must contain an even number of up spins — for instance, the ground state 
of a 4-point interaction, H = — A'^ D 01020304 where the are ±1 and K > 0. 
Allowed configurations consist of the product of horizontal and vertical stripes 
of fs and J,'s of arbitrary width, such as 
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Such a configuration is determined by its topmost row and leftmost column, 
so the number of allowed m x n blocks is N(m,n) = 2' Tl +™~ 1 and the entropy 
per site is zero. 

An LLL can enforce local topological properties. For instance, let our alpha- 
bet be {0, \, \, ,/}. If we exclude blocks where a path branches or ends, i.e. 



rotations, reflections and reversals of 



and 







blocks 



will contain only closed loops or paths that end at the boundary. 

Note that we also allow -^allowed to have jj's in it, in order to detect the 
picture's edges; for instance, we can prevent paths from beginning or ending at 
the boundary by forbidding rotations and reflections of 



and 



This forces all the paths to be closed in the block's interior. 

We can also get scale- invariant behavior in two dimensions, which local rules 
could never provide in one — for instance, suppose that our alphabet is {0, 1}, 

Yx 

and that of every block of 3 sites of the shape \ x x 



either 1 or 3 of the x's 



must be O's so that they sum to zero mod 2. If we also forbid 
upper- left corner is a 1, we get the mod-2 Pascal's Triangle: 
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so that the 
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If we create some choice by allowing the non-O's to be 1 or 2, N(n,n) = 
2 n E 1 s whenever n is a power of 2. Again the entropy per site is zero. In 
general, it is clear that the growth function of an LLL can have a wider variety 
of functional forms than in one dimension. 

One last amusing example — let the allowed blocks be 



where x = or 1, and x — not (a;). Then these blocks generate a counting 
machine, where the last one represents carrying a digit. If we further require 
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at the boundaries, we get n x 2™ rectangles such as 
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Keep in mind that just because an LLL can be described in a local way, 
large configurations are not necessarily easy to construct. As was thought of 
quasicrystals up until a local algorithm was discovered Jiq| , there may be no 
local way to grow large blocks from smaller ones; and attempts to relax large 
blocks from random initial conditions may lead to very slow, glass-like dynamics, 
as in some 2- and 3-dimensional models (e.g. 54 ). The difficulty of growing 
a pattern from an initial seed, or relaxing to a pattern from a random initial 
condition, arc themselves good definitions of complexity, and not necessarily 
correlated with the complexity of recognizing a completed picture. 
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2.3 Deterministic Finite Automata, or DFA's 

The automata in the next two sections were introduced by Blum and Hewitt 
[§. The reader may consult for a review. 

Definition. A 4-way deterministic finite-state automaton, or 4-way DFA, 
consists of a finite set of states S, an initial state So S S, a subset 5 accep t C S, 
and a transition function F : A x S — > S x {f, J,, <— , — >}. F describes how 
the DFA changes its state and moves one step up, down, left or right, as it 
encounters symbols in the alphabet A. We say that a DFA accepts a block if, 
starting in the state so m the upper-left corner, it eventually reaches some state 
in ^accept! we can demand without loss of generality that this happens at the 
lower-right corner. We say a 2-d language is DFA-recognizable or simply DFA 
if there exists a 4-way DFA which recognizes it. 

As for boundary conditions, we have a choice. The DFA can be bounded, in 
that it must always move back into the block if it detects a j}, or unbounded, 
in which it is allowed to move into the jj's. However, it can be shown that an 
unbounded DFA can be simulated by a bounded one |^9); if it has n states, it 
will either return to the block within n steps, or get caught in a loop and wander 
off to infinity. So these two are equivalent. 

We then have 

Proposition. The class of DFA languages properly contains the LLL's. 

Proof. We first show containment. We simply define a DFA which scans 
from left to right and top to bottom, and checks around the neighborhood of 
each site. It checks that each neighborhood is allowed by the LLL, and accepts 
when it arrives in the lower-right corner; if it finds an illegal neighborhood, it 
enters a Reject state and 'hangs.' 

To show the containment is proper, we give an example which is DFA but 
not LLL. Consider the language L sqU are shown in figure 1, of square blocks of 
l's on a background of O's. A DFA can recognize it by scanning the entire 
scene until it hits the upper- left corner of a block of l's; it then checks that it 
is a complete rectangle by scanning its interior, and making sure its edges are 
straight. Finally, it travels diagonally, up and left, from the lower-right corner; 
if it arrives at the upper-left corner, it has verified that it is in a square, and 
moves on to find the next block of l's. 

However, this language is not an LLL. Suppose it were, with range r. Then 
it would be unable to distinguish squares of side greater than r from rectangles, 
since as shown in figure 1 they contain all the same neighborhoods of size r or 
less. I 

As this example shows, a DFA can exploit the geometry of the two-dimensional 
lattice in order to test for structure that in one dimension would be considered 
non-regular (context-free or even context-sensitive). 

As another example, consider the language of p x q rectangles of l's in a sea 
of O's, where p and q are mutually prime; a DFA can test for this by bouncing 
around inside each rectangle like a billiard ball, accepting (and scanning for the 
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Figure 1: A DFA can recognize the language L squaro of square blocks by moving 
diagonally, but an LLL of range r cannot distinguish between rectangles and 
squares whose sides are longer than r, since all the same neighborhoods of size 
r or less appear in both. 
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Figure 2: By bouncing like a billiard ball or making knights' moves, and ending 
one cell from the corner, a DFA can check that the two sides of a rectangle are 
mutually prime, or that the side of a square is a power of 2. 

next rectangle) if it arrives one site to the left of the corner where it started. 
Figure 2 shows this and another example, where a DFA makes knights' moves 
alternately in the directions (2, —1) and (—1, 2) to verify that it is in a square of 
side 2™. These kinds of arithmetic properties would require a context-sensitive 
grammar to recognize in one dimension. 

DFAs can get stuck in loops and run forever. However, we can use an 
argument of Sipser |5q ] to convert any DFA into one which always arrives in the 
lower- right corner (in an accepting or non-accepting state) , and never gets stuck 
in a loop. This works by starting in the lower-right corner in an accepting state, 
doing a depth-first backwards search of the tree of all possible trajectories to see 
if we could have started in the initial state, and using the DFA's own dynamics 
to move back up the tree. As a corollary, the complement of a DFA language is 
also DFA, as we will mention below. 

2.4 Non-deterministic Finite Automata, or NFA's 

The next type of 2-d automaton to consider is the NFA: 

Definition. A 4-way non- deterministic finite-state automaton, or 4-way 
NFA, consists of a finite set of states S, an initial state so € S, a subset 5 acc ept C 
S, and a non-deterministic transition function F : A x S — ► p(S x {|, !,«—,—> 
}). We say that an NFA accepts a block if there exists a set of choices in F 
which leads it from the state sq in the upper-left corner to some state in S^ccept 
(without loss of generality, in the lower-right corner) . We say a 2-d language as 
NFA-recognizable or simply NFA if there exists a 4-way NFA which recognizes 
it. 

Recall that in one dimension, DFA's and NFA's are equivalent. In two or 
more dimensions, NFA's are more powerful: 

Proposition. The class of NFA languages properly contains the DFA lan- 
guages. 

Proof. Containment is obvious; we take an example from |5l[ which is NFA 
but not DFA. Let the alphabet be {0, 1, 2}, and consider squares of non-O's on a 
background of 0's, where the squares have odd side and their center site is a 2. 
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After confirming that it is in a square as we did before, an NFA can recognize 
this by moving diagonally from one corner, non-deterministically turning 90° at 
the 2 in the center, and arriving at another corner. 

A DFA, on the other hand, will get lost; there may be many other 2's floating 
around, and inside a sufficiently large square it has no way of knowing when it 
is in the center. A counting argument to prove this is given in |fil|| . I 

As another example of an NFA language, consider white mazes on a black 
background, with a red square a and a green square b: is there a path from a 
to bl An NFA can non-deterministically guess a path and confirm its existence, 
but for any DFA (or even DPDA) there is a maze it will get lost in and loop 
forever (this is an open question if the DFA can move through walls). The 
"keep your hand on the right-hand wall" method, for instance, will fail if the 
maze has a loop with a outside and b inside [|l] . We will use this as a canonical 
NFA problem to discuss the computational complexity of NFA languages in 
Section 2.8 below. 

2.5 Homomorphisms of LLL's, or h(LLL)'s 

We now come to our most subtle class of 2-d languages. 

Definition. Suppose L is a 2-d language over an alphabet A. Then we say 
L is a homomorphism of a local lattice language or h(LLL) if there is some LLL 
L' in an alphabet A' , and a homomorphism or mapping h : A' — *■ A, such that 
h(L') — L; i.e., V yields L when each symbol a' is replaced by its image h{a'). 

In other words, there is an underlying LLL, some of whose states are hidden 
by the mapping h. For example, consider the LLL whose only allowed 2x2 
blocks are those occuring in 



2 11 

12 1 

112 





Then the 2's down the diagonal enforce the squareness of each island. If we 
apply the mapping h(0) = 0, h(2) = h(l) = 1, we get the language L squa rc of 
the previous section; so this is an h(LLL) which is not an LLL. 

As another example, consider the "Eight Queens" problem, where queens are 
placed on a chessboard in such a way that none of them are attacking each other. 
The reader can easily construct an h(LLL) with symbols {Q, 0} and underlying 
symbols in the power set of {\, / , — n\, ,\} that get hidden by h and 

ensure that none of the Q's are on the same row, column, or diagonal. This 
is not an LLL by the same kind of argument as for L square ; since the queens 
can attack each other over long distances, all the same finite neighborhoods can 
occur for both attacking and non-attacking configurations. 
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In a similar vein, it is shown in |10| that homomorphisms of space-time 
diagrams of 1-d cellular automata are not necessarily given by cellular automata. 

So h(LLL)'s are more powerful than LLL's. How much more? To continue 
the hierarchy, we have 

Proposition. The class of h(LLL)'s properly contains the NFA languages. 

Proof. We first show containment. The idea is to use the same crossing se- 
quences used in the equivalence of 2-way and 1-way automata in one dimension, 
recording what sites the NFA visited and in what states; and using the hidden 
states of the h(LLL) to guess an accepting trajectory for the NFA. 

Let A 1 = A x L, where elements of L are lists of states and directions 
indicating what state the NFA was in on each visit to that site and which way 
it moved from there; for instance, ((53, — ►), (S2, T), ( s 6, *—)) would indicate that 
we visited that site 3 times, the first time in state S3 and then moving right, 
and so on. Since any loop returning to the same site in the same state can be 
pruned from the NFA's trajectory, we need only consider lists where each state 
occurs at most once; so L is finite. 

Clearly, then, the consistency of the trajectory, its directions and successive 
states according to the NFA's transition function, can be represented by a LLL 
on A' . Using corner rules, we can require that we start in the upper-left corner 
in the proper initial state, and end in the lower-right corner in Sacccpt- Finally, 
let h hide L and project A' onto A. So NFA languages are h(LLL)'s. 

We now give an example which is h(LLL) but not NFA. Consider the LLL 
allowing the 2x2 blocks occuring in 
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and let h map A and B to while leaving a and b fixed. This leaves us with 
strings in the 1-d language {a n b n } on a background of O's. 

Since this language is context-free but not regular it cannot be recog- 
nized by a 1-d NFA which moves only in the row containing the string. But 
a 2-d NFA allowed to move into the O's above and below that row is no more 
powerful than a 1-d NFA confined to it, as we will now show. 

Write the NFA's trajectory as a series of pairs (s, d) where s is its state and 
d E {T> Jo ■*—,—>■} hs direction. Then the set of trajectories that move above 
the string's row into the O's and then return to it form a context-free language, 
recognized by a PDA that pushes when it sees an f and pops when it sees a 
|. Each such trajectory returns to the row with a total horizontal displacement 
Ax, equal to the number of — >'s minus the number of <— 's. 

Now the Parikh mapping maps words to vectors, counting the number of 
occurences of each symbol: tt(w) = (#i(u>), # 2 (w), . . . , where is 
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the number of occurences of the z'th symbol in w. If L is a context-free language, 
7r(L) is a semilinear set, i.e. a finite union of sets of the form {p + J2i a i1i I a i = 
0, 1, 2 . . .} where p and q±, . . . , qu are n-dimensional vectors p9[ . 

Since As = jj=^,(w) — and since applying a linear functional to a 

semilinear set yields another semilinear set, the set of possible Ax's is a finite 
union of sets of the form {p + nq \ n = 0, 1, 2 . . .} depending on the initial and 
final state of the NFA. In other words, it is eventually periodic. 

Since the set of words whose lengths form an eventually periodic set is a 
regular language, this means that the NFA's excursions can be simulated by a 
two-way 1-d NFA confined to the string's row, which travels from the 2-d NFA's 
starting point to the point of its return, while keeping track of what state it can 
be in when it gets there. Since a 1-d NFA can only recognize regular languages, 
and since {a n b n } is not regular, this h(LLL) is not NFA. I 

As another example, consider nx2 n rectangles of a single symbol; this is an 
h(LLL) using the counting machine LLL from Section 2.2, but it is shown in p8| 
that it cannot be recognized by an NFA. In jl5| a function is called recognizable 
if the set of n x f(n) rectangles of a single symbol is an h(LLL); they show that 
any function that obeys a linear recurrence relation is recognizable. 

Our lemma regarding an NFA's excursions off a n x 1 strip appears to be 
new; it is an open question whether this can be extended to excursions outside 
an m x n rectangle |5l| . 

We can now summarize our results by saying 

LLL C DFA C NFA C h(LLL) 

with each inclusion proper in d > 2. 

There are many interesting examples of h(LLL)'s; here are some of our 
favorites. 

1.) Two-dimensional L-systems, produced by some expansion rule. 

L-systems |33| are languages generated by applying some dilation rule simultane- 
ously everywhere in the string. For instance, the rules — > 01, 1 — > 10 generate 
the Morse sequence 0110 1001 1001 0110 . . ., and the rules a — > ab,b — > a generate 
the Fibonacci sequence abaab aba abaab . . . that appears in the renormalization 
theory of circle maps. 

In two dimensions, we can expand the lattice by recursively replacing char- 
acters with blocks of a certain size, rather like the expansion rules for perfect 
quasicrystals. The Sierpinski carpet, for instance, is generated by a 3 x 3 ex- 
pansion rule as shown in figure 3. 

To construct an h(LLL) for these, we need an underlying hierarchical struc- 
ture which identifies blocks with their parent blocks on larger and larger scales. 
Robinson Jm], [l8| has constructed a set of tiles which does just that, by running 
H's along the boundaries between blocks and connecting them with the larger 
H's of their parent block, in such a way as to enforce a hierarchical tree. 

By decorating his tiling as shown in figure 4, we can enforce 2x2 expansion 
rules such as a two-dimensional version of the Morse sequence; the homomor- 
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Figure 3: A 3 x 3 L-system that generates the Sierpinski carpet. 



phism removes the H's, leaving us just with the symbols on the leaves. We could 
also make such a rule either locally (with choices made at each site) or globally 
(with the same choice used everywhere) non-deterministic. 

Clearly, we could make a similar construction for m x n expansions, as long 
as m,n > 2. For a n x 1 rule, which is simply a one-dimensional L-system, we 
need some blank space above or below the string's row to parse it; for instance, 
consider the LLL whose allowed 2x2 blocks are those appearing in 
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Then rows of a's have words in a*b* below them, and 6's have a's below them, 
enforcing the rules a — > afe and & — > a and ending with Fibonacci sequences of 
A's and _B's at the bottom. 

2.) Topological examples, such as non-simply connected blobs. 
Just as an LLL can test local topological properties, an h(LLL) can test for the 
existence of a local topological structure. For instance, consider the LLL on the 
alphabet {0,\,/",\,^/}, where we forbid 



\ 



\ 



\ 







and their rotations and reflections. Informally, then, we have a vector field 
which is outward at the boundary, and has no fixed points except saddle points 



such 



and 



If h then maps all the arrows onto 1, we get 



blobs of l's which support a vector field of this form. 
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Figure 4: A two-dimensional L-system analogous to the Morse sequence. An 
h(LLL) can recognize it with a decorated version of Robinson's aperiodic tiles; 
the parent symbols are carried along the stems of the H's, giving rise to the 
daughter symbols where they branch. 

In the continuous case, the existence of such a vector field on a compact set 
U in R 2 would imply that U has at least one hole, since by the Poincare-Hopf 
index theorem |l9| the number of sources, sinks, and circulations minus the 
number of saddles equals U's Euler characteristic \- If we allow only saddle 
points, x < 0, so we have genus 1 or more; if we forbid saddle points as well, 
X = and we have exactly one hole. 

We now show that this is true in the discrete case as well. Start at the 
boundary of a finite blob, and trace the vectors backward; since sources are 
forbidden, there is always at least one predecessor. Since the blob is finite, we 
must eventually find ourselves on a closed curve. 

Suppose the curve is filled entirely with l's so that there are no holes inside it. 
Then starting at a point on the curve and heading inward, along either successors 
or predecessors, we must either come back out or get caught in another cycle, 
as shown in figure 5. But in either case, we have found a cycle smaller than the 
first one. We continue this process until we have a cycle around a single vertex, 
which is forbidden. So by contradiction, the blob contains at least one hole. 

Similarly, one can show that two or more holes imply the existence of a 
'saddle path', a closed curve connecting four points of the form a — > b <— c — > 
d <— a, which contains a smaller saddle path, and so on down to a saddle point. 
So there can be only one hole per blob if saddles are forbidden. 

Conversely, any blob of l's with one or more holes has a vector field without 
sources or sinks; just draw a closed curve around each hole, and then extend 
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y / y \ / 



Figure 5: If sources and sinks are forbidden, start at any point in a cycle and 
move inward (forward or backward along the vector field). The path must re- 
emerge on the cycle, or get caught in a loop; either way, the cycle contains a 
smaller one. 



arrows outward to the boundaries. (We need a certain thickness to do this; 
three cells is usually enough.) 

Thus the language of blobs of l's with at least one (or exactly one) hole is 
an h(LLL). A rather different construction, not based on vector fields, allows 
us to check whether our blobs of l's are simply connected, by checking whether 
the O's are connected (0, p. 191). 

Nakamura has shown that the language of connected blobs of 1 's cannot be 
recognized by a DFA or NFA in three dimensions pj| ; the question is apparently 
still open in d = 2. 

3. ) Non-acyclic graphs. With a suitable alphabet, we can draw out 
arbitrary directed graphs on a lattice, and then let the underlying LLL guess a 
cycle in the graph by coloring a set of edges which doesn't begin, end, or branch. 
Thus the set of directed graphs containing a cycle is an h(LLL). 

4. ) NP-complete problems. Consider an LLL on the alphabet {r, g, b, B, 0} 
in which r, <?, and b represent three colors, a background, and B a boundary be- 



tween two domains. Then forbid rotations of i y , B B , and x B : 
where x, y S {r, g, b} and x ^ y. 

If we then apply the homomorphism h : r, g,b — > x, we get an h(LLL) of 
3-colorable maps, where blobs of x's can be colored red, blue or green such that 
the colors are the same within each blob but differ across a boundary of B's. 
Thus an h(LLL) can determine whether or not a map is 3-colorable. 

The reader may be aware that this problem is NP -complete. Another such 
problem is Boolean Satisfiability, the question of whether a Boolean circuit has a 
set of inputs that makes the output true ]l2| ; the reader may enjoy constructing 
an h(LLL) with symbols for wires and logical gates where the hidden states 
guess the input values. This is similar to why questions such as whether a spin 
glass has a ground state below a certain energy are NP-completc [0] : the hidden 
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states guess the spin configuration, and h leaves just the couplings visible. We 
discuss NP-completeness, and analogous results for DFA's and NFA's, further 
in Section 2.8. 

Giammarresi and Restivo |l3| ] call h(LLL)'s the recognizable languages or 
REC. They show that they are exactly the homomorphisms of languages defin- 
able with a two-dimensional analog of regular expressions, with horizontal and 
vertical versions of concatenation and the * operator |l5| ; they also show that 
h(LLL)'s are exactly the languages definable with existential monadic second- 
order formulas jl4| . Inoue and Nakamura [^6| also defined a class equivalent to 
h(LLL)'s, the non- deterministic on-line tesselation acceptors. 

2.6 Closure properties 

One of the most basic questions we can ask about a class C of languages is 
whether it is closed under various operations: for instance, for two languages 
L\,Ij2 € C, are L\ f| L2, L\ U L2, or L\ also in C? A class with many such 
closure properties is a more elegant algebraic object, and more likely to capture 
a natural set of languages, than one without. 

Closure properties can often simplify a proof in automata theory; for in- 
stance, if L = L\ H L2, it might be easier to show that L\ and L2 are regular 
than to show that L is directly. 

In one dimension, the regular languages are closed under intersection, union 
and complement |2^| ; this generalizes in different ways in d > 2, as we will now 
see. 

Proposition. The class of LLL's is closed under intersection but not under 
union or complement. 

Proof. For intersection, simply forbid any block which either LLL forbids. 

For union, let L\ consist of isolated l's and L2 of isolated 2's, both on a 
background of O's. Then L\ U L2 consists of pictures with either l's or 2's, but 
never both; but an LLL of range r cannot distinguish this set of pictures from 
those with both l's and 2's separated from each other by r or more. So L\ U L2 
is not an LLL. 

For complement, let A = {0, 1} and let L be the single picture with all O's. 
Then L includes pictures with isolated l's; but an LLL that allows l's separated 
by an arbitrary width of O's will also accept L. So L is not an LLL. I 

However, the union of two LLL's with disjoint alphabets Ai and A2 is clearly 
an LLL: just forbid neighborhoods containing elements of both A\ and A2, so 
that each picture consists entirely of one or the other. 

Proposition. The class of DFA languages is closed under intersection, 
union and complement. 

Proof. For intersection, run the first DFA and then the second, returning 
to the upper-left corner after the first one accepts. For complement, use the 
backwards search from the accepting state described above from J5f| . Then the 

union can be written using De Morgan's law, L\ U L2 = L\ n £2- I 
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Proposition. The class of NFA languages is closed under union and inter- 
section. 

Proof. For intersection, run one DFA after the other. For union, non- 
deterministically choose at the outset which one to run. I 

We use the closure of NFA's under intersection to prove a proposition in 
Section 3.3 below. 

Proposition. The class of h(LLL)'s is closed under union and intersection, 
but not under complement. 

Proof. For intersection, suppose the h(LLL)'s have two underlying LLL's, 
L\ and L 2 , and homomorphisms h\ and hi. Then let L' = L\ x L2 be the 
LLL with pairs of states (si, S2) at each site with the two components obeying 
L\ and L 2 respectively, with the additional requirement that h\{s\) = h 2 (s 2 ). 
Then L x n L 2 = h'(L') where h'{(s 1 ,s 2 )) = fti(si) = h 2 (s 2 ). 

For union, assume without loss of generality that the alphabets A\,A 2 of 
the underlying LLL's are disjoint. Then let V be the union of the two LLL's 
on the alphabet A\ U A 2 where neighborhoods containing elements of both A\ 
and A 2 are forbidden, and let h'(s) = hi(s) if s £ Ai for i = 1,2. 

For complement, consider pictures consisting of a single row of 2's, with rows 
of O's and l's above and below it, such that there is a row above the 2's which 
is not equal to any of the rows below the 2's. It is easy to see that this is an 
h(LLL), but it is shown in [^6| that its complement is not. I 

In jl3| it is shown that h(LLL)'s are closed under horizontal and vertical 
versions of concatenation and the * operator; DFA's and NFA's are not |2^, |27j. 

It is an open question whether NFA's are closed under complement. It seems 
unlikely, since (for instance) the set of mazes with no route from a to & would be 
NFA. The basic problem is that NFA's are defined with an existential quantifier, 
3 ("there exists") an accepting trajectory; while the complement of such a set 
is defined with a universal quantifier, V. However, we will see below that NFA's 
could be closed under complement within standard beliefs about complexity 
classes. 

2.7 Extensibility of finite blocks 

In many cases, we are interested in the set of blocks of an LLL that can be 
extended to cover the plane; if we are studying a statistical mechanics system 
on an infinite lattice, for instance, the only finite blocks that are physically 
relevant are those that appear in infinite configurations. 

The set R^E^L)) of finite blocks that are infinitely extensible, i.e. that 
appear in an infinite allowed configuration of a LLL or h(LLL), can be a proper 
subset of L. It can be more or less complex than i; for that matter, it can be 
empty. 

For instance, consider the LLL where horizontal, vertical and diagonal lines 
extend across a blank sea, without being allowed to bend, cross or branch: 
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Figure 6: An finite block of the LLL described in the text, which cannot be 
extended because the two lines intersect. 



allowed blocks being of the form 
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, and 





with blocks such as 



and 



forbidden. A finite allowed block can contain both diagonal 
and (say) vertical lines; but such a block is not extensible, since those lines 
intersect outside the block's boundary as in figure 6. The set of extensible blocks 
consists of those whose lines all have the same orientation; this is recognizable 
by a DFA. 

As another example, consider the h(LLL)'s given above that recognize words 
in non-regular languages; extensible n x 1 strips of the underlying LLL's are 
not DFA, NFA or h(LLL) since all of these can only recognize regular one- 
dimensional languages when confined to a horizontal strip. So the extensible 
subset of an LLL can have a variety of complexities. 

In general, the question of extensibility is undccidablc. Recall that a set is 
recursively enumerable if some Turing machine accepts it by halting when given 
its elements as input, and recursive if both it and its complement are recursively 
enumerable. Then: 

Proposition. The set of infinitely extensible finite blocks of an LLL is the 
complement of a recursively enumerable set, and in two or more dimensions 
is non-recursive in general. Thus it is undecidable whether a finite block is 
infinitely extensible. 

Proof. To show that its complement is recursively enumerable, consider a 
Turing machine which takes a finite block as input and attempts to extend it 
an increasing distance outside its boundary (say, in a spiral around the original 
block) by doing a depth- first search of possible extensions. If it meets a forbidden 
neighborhood, it backtracks and tries the next symbol at the most recent place 
where it had more than one choice; it halts if it has tried all possible states and 
it has no choices left. So, if the block can only be extended m sites around the 
spiral, the machine will halt after at most 0(k m ) computation steps where k 
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is the number of symbols in the LLL's alphabet. If the block is extensible, the 
machine will never halt, so the set of extensible blocks is the complement of the 
TM's halting set. 

To give a non-recursive example, recall that space-time diagrams of one- 
dimensional cellular automata are LLL's in two dimensions. Choose a CA that 
simulates a universal Turing machine (e.g. f35| ) and forbid any neighborhood 
containing the halt state. Then the set of extensible n x 1 rows with the Turing 
machine properly initialized in the upper-left corner are precisely those inputs 
on which the Turing machine will not halt; this is a non-recursive set since the 
Halting Problem is undecidable. I 

The Cluster Variation Method (CVM) in statistical mechanics is a general- 
ization of the mean-field approximation, in which we keep track of the frequency 
of finite blocks up to a certain size and ignore correlations over larger scales. 
To apply it, we need to know when there is a measure on infinite configurations 
that is consistent with a given set of block frequencies. Since only infinitely 
extensible blocks can contribute to such a measure, it is undecidable in d > 2 
whether the CVM is applicable to a given system |33| . 

In one dimension, on the other hand, if L is a regular language then its 
extensible subset is also, since if the finite automaton accepting it has n states, 
extensibility of a finite word depends only on its first and last n symbols. 

The question of whether an LLL in two or more dimensions has any infinite 
allowed configurations at all is also undecidable; this is the Tiling Problem 
[|[ in which we try to cover the plane with a set of interlocking tiles. In one 
dimension, a finite state automaton accepts an infinite word if and only if there 
are loops in its transition graph, which is easily decidable. 

Of course, there are classes of LLL's where every block is extensible, such as 
finite time sets and limit sets of CA's (since by definition these are derived from 
infinite initial states, see Section 3) and 2x2 LLL's where the set of allowed 
blocks is reflection-symmetric, so that a block can be extended as in 

a b c b a 
d e / e d 
a b c b a 



In addition, if periodic configurations are dense in the set of infinite allowed 
configurations, then the extension problem is decidable f57| : since every exten- 
sible finite block is contained in a periodic infinite configuration, as we try to 
extend a block we either run out of choices or reach a periodic block which can 
be repeated, so either outcome is decided in finite time. This includes the case 
where the LLL's allowed configurations form a group 



a b c 
d e / 
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2.8 Acceptance problems and computational complexity 



One way to characterize the power of a class of machines or languages is by 
the computational complexity of its Acceptance problem: given a machine M 
and an m x ?i picture x, does M accept xl Although we have seen that many 
questions regarding infinite pictures are undecidable, we can get some interesting 
results if we restrict ourselves to finite ones, and ask how the computational 
resources needed grow with the size of the picture. 

We recommend |Q to the reader as an introduction to the complexity classes 
we use in the following. 

For LLL's, the problem is easy: acceptance is simply the AND of one predi- 
cate for each neighborhood, which is true if that neighborhood is allowed. This 
can be done in parallel in constant time, if we can AND an arbitrary number 
of things together at once; thus LLL Acceptance is in the class SAC of 
problems solvable by semi-unbounded constant-depth circuits, where one kind 
of gate (in this case AND) is allowed to have an arbitrary number of inputs p7[ . 

For NFA's, one might think that a deterministic machine would have to 
explore an exponential number of trajectories to check for an accepting one. 
However, NFA Acceptance is really just a special case of Graph Reach- 
ability, in which we ask whether there is a path from node a to node b in 
a directed graph: if the NFA has s states, then the nodes of the graph are 
the m x n x s combinations of location and state, and a and b are the initial 
and accepting final states in the upper-left and lower-right corners respectively. 
Conversely, any Graph Reachability problem can be converted to an NFA 
Acceptance problem by drawing out the graph as a maze, and asking the NFA 
to find a path from a to 6 as in Section 2.4 above. 

Graph Reachability is NL- complete, where NL is the class of problems 
solvable by non-deterministic Turing machines with logarithmic space; and since 
these two problems are equivalent, NFA Acceptance is NL-complete too. 
Since NL is contained in the class NC 2 of problems solvable by circuit of depth 
0(log 2 n) for inputs of size n, NFA Acceptance can be solved by a parallel 
computer in C(log 2 mns) time. A serial computer can solve it in 0{mns) time, 
by starting with the initial state and iteratively adding all possible transitions 
to a list of accessible states and sites. 

In the same way, DFA Acceptance is a special case of Reachability 
for directed trees where each node has at most one outgoing edge, and con- 
versely a DFA can explore any such graph drawn on its lattice. This problem 
is complete for the class L of deterministic log-space Turing machines, so DFA 
Acceptance is L-complete. 

Finally, as we saw above, h(LLL) Acceptance is NP-complete since h(LLL)'s 
can guess colorings of graphs or satisfying assignments for Boolean expressions; 
it is in NP since we can guess the hidden states, and easily check (in SAC ) 
whether they satisfy the LLL and match the picture. 

Thus we have proved 
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Proposition. The Acceptance problem is in SAC , L-complete, NL- 
complete, andNP -complete for LLL's, DFA's, NFA's and h(LLL) 's respectively. 

Since by the Immerman-Szelepczenyi theorem [p| the class NL is closed un- 
der complement, NFA's could be closed under complement without any drastic 
consequences to complexity theory. 

3 Applications to Cellular Automata 
3.1 Cellular automata 

Cellular automata or CA's are spatially extended dynamical systems defined 
on a regular lattice where the state at each site consists of a symbol from a 
finite alphabet. Computation theory has been used to characterize the dynam- 
ics of one-dimensional CA's (e.g. H]); in particular one can describe sets of 
configurations such as periodic sets, finite time sets, and limit sets in terms of 
their languages of allowed finite blocks. For example, it was shown by Wolfram 
that any finite time set (set of images allowed at a certain time) of a 1-d CA is 
described by a regular language |Kj ■ 

In this Section, we apply some of the results of Section 2 to the dynamics of 
CA's in two or more dimensions. In particular, we show that the appropriate 
generalizations of regular languages for fixed and periodic points on the one 
hand, and finite time sets on the other, are LLL's and h(LLL)'s respectively. 

Let us first introduce a few definitions, for simplicity given for the case of a 
2-d CA on a square lattice (extensions to higher dimensions and other regular 
lattices are straight-forward). Let the state at each site (x,y) of Z x Z be 
a (x,y) G ^> where A is a finite alphabet with \A\ — k symbols. All lattice 
sites are updated simultaneously using a local transition function / : A B — > A, 
where B is some finite neighborhood surrounding each site. This induces a 
global CA mapping F : A ZxZ — > A Zy ' z . (According to the Curtis-Hedhmd- 
Lyndon theorem J22]| , a map on A ZxIj is a CA if and only if it commutes with 
the horizontal and vertical shift maps, and if it is continuous with respect to 
product topology induced by the the discrete topology on A.) 



One commonly used neighborhood on the square lattice is the 5-site von 
Neumann neighborhood shown above; then the local transition function has the 

form oj+^j = /(0( B| j,}.0( ie+ i, 1 ,)»o|»-i jl ,).a(x,i,+i)« (x 1 v-i))- A CA has radms r 
if its neighborhood is contained in a square of side 2r + 1; the von Neumann 
neighborhood has radius 1. 

Typically, we consider the time evolution of subshifts, i.e. closed (under the 
product topology) translation-invariant sets of infinite configurations. Starting 
with the set A ZxZ of all infinite configurations, only a certain subset of these 
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are allowed after a given number of time-steps; so one basic kind of language 
we can associate with a CA is the image of F*. 

Definition. The finite time set SI* of a 2-d CA is defined by 

SI* = F t (Sl°) 

where Sl° = A Zx1, is the set of all infinite configurations. 

As t — > oo, the asymptotic behavior of a CA is described by its limit set, 
which consists of those configurations in SI* for all t (and conversely, which have 
predecessors arbitrarily far back in time): 

Definition. The limit set of a CA is defined as 

oo 

Sl°° = f] SI* 

t=0 

The finite time sets and limit set clearly obey 

Sl° D SI 1 D SI 2 D . . . D SI 00 

We can also look at the set of periodic points of a CA: 
Definition. The period-p set is given by 

IP = {ce A ZyZ \F p (c)=c} 

The set of all periodic points is the periodic set, H = U^Li n p - Clearly II C Sl°°. 

These definitions work for infinite configurations. Finite time sets, limit 
sets, and periodic sets are all subshifts, so equivalent definitions can be given 
in terms of finite blocks. If we use the local transition function to define a CA 
map F : ^(™+2r) x (n+2r) _^ A mxn on finitc blocks, we have 

R(Sl*) = F*(R(Sl )) 

and 

oo 

R(n°°) = p| r(si*) 

t=0 

where R is the restriction operator defined in Section 2.1. 



3.2 Periodic sets 

Let us first consider the fixed point configurations of a 2-d CA. These form an 
LLL |47): 

Proposition. The fixed point set of a CA with radius r is described by an 
LLL of range 2r + 1. 

Proof. Simply allow those neighborhoods (3 E A B of size 2r + 1 for which 
the center symbol is fixed, i.e. f(f3) = /?(o,o)- B 
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Since the p'th iteration of the CA mapping is itself a CA with radius pr, 
whose fixed points are the period-p states of the original CA, we also have 

Corollary. The period-p set IP of a CA with radius r is described by an 
LLL of range 2pr + 1 . 

In fact, the converse is true as well: 

Proposition. For any LLL and any p, there is a CA for which the LLL is 
its period-p set IP . Therefore, it is undecidable whether an arbitrary 2-d CA 
has a periodic orbit of a particular period p. 

Proof. This is easy in the fixed point case (p = 1): let the CA change 
the value of a site if it belongs to a forbidden block of the LLL, and leave it 
unchanged otherwise. Then only allowed configurations are fixed. 

For p > 1, if the LLL has alphabet A, introduce an extended alphabet A' = 
Ax{0, . . . ,p}. Let the CA rule be given by (a, n) — > (a, (n+1) mod if the 

symbol a belongs to a forbidden block, and (a, n) — ► (a, (n+1) mod p) otherwise. 
All configurations are then periodic; but one consisting only of allowed blocks 
has period p, while all others have period p + 1 or p(p + 1). Thus IP is the 
desired LLL. 

From the undecidability of the extension problem for LLL's (see Section 2.7), 
it follows that it is undecidable whether a CA has an orbit of period p. I 

Since homogenous configurations map among each other, a CA with k states 
always has some periodic orbit of period p < k. So, in contrast to the result 
above, the question of whether a CA has any periodic orbits is trivially decidable. 

As an example, let us consider the fixed points of the additive 2-d CA defined 

by 

<!) = Km) + 4+i j) + a h-i,j) + a h+D + a \i,i-i)) mod 2 



In a fixed point configuration, a*^ 1 ) = dh j\ and 

+ fl (Hi,i) + a \i-ij) + a lj+i) = mod 2 

Then if odd and even sublattices are considered separately, we obtain two in- 
dependent copies of the LLL in Section 2.2 with an even number of up spins in 
each 2x2 block. This means that the entropy per site of the fixed point set is 
zero, since e.g. for anxn diamond (with n sites along diagonal edges and a total 
of n 2 + (n — l) 2 sites) the number of allowed configurations is N(n) — 2 4 "~ 4 . 

A more general statement for periodic sets can be made for a class of CA's 
that form the 2-d analog of the left (right) permutive CA rules studied in e.g. 
[ p2| |30| [34| (the ergodic properties of this class of 2-d rules are studied in j|?J ) : 

Proposition. Consider a 2-d CA with neighborhood B associated with a 
site (0,0). Say a site in B is extremal if it cannot be written as a convex 
combination of other points in B. If the transition function f is an injective 
function of some extremal site (x, y) when all other inputs are held fixed, and if 
(x,y) ^ (0,0), then the entropy ofW is zero for any p. 

Proof. Since (x, y) is extremal, it can be separated from the rest of B by a 
straight line L as shown in figure 7. Injectivity implies that the value of o,< x> y) 
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(0,0) 



(x,y) 



Figure 7: The fixed points of a CA which is injective on an extremal site in its 
neighborhood are determined by strips of constant width on the outside of a 
rectangle. 



is uniquely determined by the other sites in the neighborhood together with 
a (o~o)' for a fixed point, «*Jq) = a *oo)' so a (x,y) 1S determined by the rest of 
the neighborhood. This in turn means that a rectangular block with its top 
edge oriented along L is uniquely determined by strips of sites of width at most 
2r along three of its sides; so the number of m x n configurations grows as 
N(m, n) = \ A\ cm+dn for constants c and d, and the entropy per site is zero. 

The generalization to IP follows from the fact that if / is injective on an 
extremal site (x,y), then f p is injective on the site (j>x,py), which is extremal 
in its larger neighborhood. I 

This class of CA rules includes additive rules with a prime number of states 
(except for those with a one-cell neighborhood). Additive CA's with a composite 
number of states where some periodic sets have positive entropy can easily be 
constructed, even in d = 1: let k — 4, and let a* +1 = {2a t i _ 1 + a' +1 ) mod 4. 
Then any sequence consisting only of symbols and 2 is a fixed point, and all 
other configurations have period 2; so both II 1 and n 2 have positive entropy. 



3.3 Finite time sets 

Examples of CA finite time sets can be found in a number of the language classes 
discussed in Section 2. A very simple example is given by the von Neumann 



neighborhood rule which maps Q7 



a] to 1 and all other neighborhoods to 0. 



This rule reaches a fixed point afrit one time-step, so fi 1 = Q°° — II 1 and are 
all described by the LLL where no adjacent pairs of l's are allowed. 

In general, if the limit set of a CA is described by an LLL, then it is reached at 
finite time. This was stated in |^| and shown for the 1-d case in J25|. The proof 
for arbitrary dimensions is essentially identical to that in one dimension, even 
though this case might seem more subtle because of the distinction between 
the defining LLL and the set of finite blocks that actually appear in infinite 
configurations. 
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Proposition. If the limit set O 00 of a CA is described by an LLL, there 
exists a finite time t such that fi* = ft 00 . 

Proof. If f2°° is described by a finite list A of forbidden blocks, then each 
block b must be excluded at some finite time t(b). If we let t = ma^i, e \t(b), 
then no further blocks are forbidden after t and fi* = ft 00 . I 

Some LLL's are CA finite time sets (and by this Proposition also limit sets), 
such as those where some symbol a does not appear in any of the forbidden 
blocks: let the CA change any symbol to a if it belongs to some forbidden 
block, and leave it unchanged otherwise. Then after one time-step, only allowed 
neighborhoods remain. 

However, not every LLL is a CA finite time set, as we will now show using 
a lemma similar in spirit to the Pumping Lemma for 1-d languages: 

The Patching Lemma. Suppose L = 0* for some CA with radius r. Let 
P 1 , P 2 , . . . ,P k be pictures in L. Let g be a function from Z x Z to {1,2, ... , k}. 
Then if we define a new picture P in patches as P( X: y) = P^ x ^ , there is a 
picture P' in L which coincides with P everywhere where g is constant for rt 
sites in all directions, i. e. P and P' only differ within rt sites of the boundaries 
of g 's domains. 

Proof. (Shorter than the statement of the lemma.) Each P % is F t (Q t ) for 
some initial state Qj. Define Q in patches as Q( x . y ) = Q 9 ^^', then P' = F t (Q) 
is as described. I 

Then we can prove that the language L lcct from Section 2.2 is not a CA 
finite time set or limit set: 

Proposition. There are LLL's which are not CA finite time sets or limit 
sets. 

Proof. Suppose L lec t = fi* for a CA of radius r. Let Pi be a single square of 
l's of side n > Art + 2, let P 2 be all O's, and let P be a patch of the two as shown 
in figure 8, where g = 2 in a square inside Pi of side m with n — 2rt > m > 2rt. 
Then P has at least some O's inside a square of l's, which is clearly not in L rcct ; 
so L rcct violates the Patching Lemma. 

So L roct cannot be a finite time set of any CA, and since limit sets which are 
LLL's are also finite time sets, it cannot be the limit set of any CA either. I 

On the other hand, we have 

Proposition. Any h(LLL) is the intersection of a CA finite time set with 
an LLL. 

Proof. Let the CA's alphabet be the alphabet of the underlying LLL, with 
an additional "error" symbol x. Then let the CA rule map any site belonging 
to a forbidden block of the LLL to x, and map sites whose neighborhoods are 
allowed according the homomorphism h. Then the h(LLL) is the intersection of 
17 1 with the LLL L x that forbids x from appearing. I 

Corollary. There are CA finite time sets that are not NFA 's. 

Proof. Let L be an h(LLL) which is not NFA, such as the one from Section 
2.5 that recognizes strips of {a n b n }. Then L — ft 1 n L x by the previous Propo- 
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Figure 8: Using the Patching Lemma to show that L rect is not a CA finite time 
set. Black and white are l's and O's respectively. 

sition. The NFA's are closed under intersection (Section 2.6), and L% is an LLL 
and so also an NFA, so L would be an NFA if fi 1 were, and it isn't. I 

To complete the relationship between CA finite time sets and the language 
classes we have discussed, we have 

Proposition. The class of CA finite time sets is properly contained in the 
class of h(LLL)'s. 

Proof. First we show that O* is an h(LLL) for any t. Let the CA's neigh- 
borhood be B, and denote the neighborhood of the f'th iteration of the CA 
mapping F f as B l . Introduce a new alphabet A' = A B , whose symbols con- 
sist of neighborhood configurations [3. Then an LLL of range 2rt can ensure 
that the /?'s overlap in a consistent way, and fi* is obtained by applying the 
homomorphism h = F . 

To show the inclusion is proper, take L rec t or any other h(LLL) that is not 
a CA finite time set. E 

One important property of CA finite time sets (and also CA limit sets) is 
that they are, by definition, extendable to infinite configurations since they are 
generated from infinite initial conditions; thus the extension problem is trivially 
decidablc. In addition, we can show that the growth function N(m,n) of a CA 
finite time set must have a comparatively simple form. 

In Section 2.2 we saw LLL's in d = 2 with a wide variety of N, including a 
number with zero entropy. This is in sharp distinction to the one-dimensional 
case, in which the leading behavior of N(n) for regular languages is always 
n k \ n where A is algebraic and k is a non-negative integer, and the entropy is 
a = log A. For CA finite time sets we can recover a weak analogy of this: 

Proposition. In any number of dimensions, a finite time set 0* of a CA 
with radius r either consists of one homogeneous picture, or has a growth func- 
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tion for a volume v bounded by 

N(v) > 2"/( 2rt+1 ) d 

and thus an entropy per site of at least a > (log2)/(2rt + l) d . 

Proof. Use the Patching Lemma; unless L contains only one picture, two 
neighborhoods get mapped to different states by F*, and we can choose to fill 
each block of width 2rt + 1 with one or the other, giving the stated result. I 

Thus the entropy of Q* cannot decrease faster than t~ d unless the CA con- 
verges to a single homogeneous fixed point in finite time. 

3.4 Limit sets 

In |Q , Hurd uses travelling particles to enforce context-free and context-sensitive 
structures in the limit sets of one-dimensional CA's. We can use a similar strat- 
egy to construct limit sets in two dimensions which are DFA, NFA, or h(LLL). 

The CA rule sketched in figure 9, for instance, is designed to allow squares 
of l's in a sea of O's, by extending a string of a's down and to the right from the 
upper-left corner of each rectangle of l's. If it meets another corner, it knows 
the rectangle is a square; it turns the head of the string to a 6, retracts it up and 
to the left, and begins again. If it meets an edge it generates an error symbol x 
(which is also generated if the rules of L roc t are violated) , which propagates at 
the speed of light and destroys the entire lattice. 

This CA's limit set then consists of squares of l's, with strings of a's and 
an optional b at various stages of construction along the diagonal, plus various 
propagating fronts of a:'s. It is easy to show that this is a DFA but not an LLL, 
since we get L squaie if we intersect it with the LLL forbidding a, b, and x. 

Similarly, we can recognize squares of l's and 2's with 2's in the center by 
extending synchronized strings along two diagonals, or strips of the language 
{a n b n } by extending strings from the middle and the ends as shown in figure 
10. Then the corresponding limit sets are NFA and h(LLL) respectively. 

Every CA limit set is the complement of a recursively enumerable set, but is 
not generally recursive, even in one dimension [p4j| . The proof is similar to that 
for extensibility in Section 2.7, except that the Turing machine tries to construct 
preimages, rather than extensions, of blocks to see if they are in i?(f2°°). 

We used the Patching Lemma above to prove that there are LLL's which are 
not limit sets. A logical question, then, is 

Open question: Are there DFA's, NFA's or h(LLL)'s, other than LLL's, 
which we can prove are not CA limit sets? 

4 Generalizations 
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Figure 9: A CA rule on the alphabet {0,1, a, b, x} that allows only squares of 
l's on a background of 0's in its limit set, and its evolution on a 3 x 3 square. 
A blank means "don't care" , and the x-spreading rule takes precedence over all 
others. 
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Figure 10: Enforcing NFA and h(LLL) limit sets with synchronized particles. 
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log I 

Figure 11: By using the method of figure 2 on diagonal slices, a 3-d DFA in a 
cube of side I can confirm that m and / are mutually prime for all m < I, so that 
I is prime. By generalizing the knights' move method, it can calculate log/, and 
then iterate this process to confirm that I = 2 | 3 k for some k. 

4.1 Higher dimensions 

Going to three or more dimensions doesn't change things as much as going from 
one to two did. The proper inclusions LLL C DFA C NFA C h(LLL) still 
hold: rf-dimensional cubes are DFA but not LLL, odd-sided cubes of l's and 
2's with a 2 in the center are NFA but not DFA, and h(LLL)'s that are strips 
or layers of lower-dimensional non-NFA languages surrounded by blanks cannot 
be recognized by NFA's. The results on CA languages in Section 3 also hold. 

However, there are some interesting higher-dimensional examples. Cubes of 
prime side I are 3-d DFA; by taking diagonal slices as shown in figure 11, we 
can use the 2-d DFA of figure 2 to confirm that I and m are mutually prime for 
every m < I. If we have a fourth coordinate, we can add m to it whenever m 
divides I, and check that I is the sum of its factors; so tesseracts of perfect side 
arc 4-d DFA! 

We can also generalize the 2" x 2™ square language discussed above. If we 
define 

2 t„ k = 2 U-i (2 t„-i (• • • (2 Tn-i 2))), 2 to k = 2 + k 

V v ' 

k times 

then 2 fi k = 2k, 2 "f 2 k — 2 k , and so on. Then a DFA in d > 1 dimensions 
can verify that it is in a c?-cube of side I = 2 k for some k. In d = 3, for 
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example, it does this as shown in figure 11 by moving one step in a perpendicular 
direction at the end of each series of knights' moves; at the end of the process 
this coordinate is log 2 I. We can then use this as a starting point and calculate 



log 2 log 2 I, and so on, until we get to 1, showing that I = 2 I3 k = 2 2 for some 
number k of levels of exponentiation. If we define hog / = k, then in d = 4 wc 
can calculate iterated llogarithms, and so on. 
For fun, if we define 



then we state without proof (exercise for the reader) that a DFA in d > 4 can 
check that a cube has side 3!d-3fc for some k. 

The DFA's in this and the previous example can be thought of as finite 
automata with d counters, namely the DFA's coordinates; it increments and 
decrements them by moving, and checks for zero by hitting the side of the cube. 

Topological examples also become more interesting. By checking for the 
existence of a consistent normal, 3-d h(LLL)'s can confirm that a manifold is 
orientable; and perhaps a clever reader can come up with a discrete foliation or 
vector field in R — K which only exists for certain knot or link types K. We 
can also more easily represent non-planar graphs. 

However, some problems may get harder: the "keep your hand on the left- 
hand wall" algorithm for traversing acyclic mazes no longer works for d > 2, and 
some spin systems that are exactly solvable in two dimensions are not known 
to be in three. 

4.2 Higher types of acceptors 

Why not continue up the ladder of the Chomsky hierarchy, to two-dimensional 
versions of push-down automata and Turing machines? Partly because the 
distinction between one and more dimensions is not so great as for regular 
languages. Recall that a two-way push-down automaton (2PDA) is a finite- 
state machine with access to a stack memory which can move left or right on 
its input; a Turing machine (TM) is a finite-state machine which can move left 
or right and write new symbols on its tape; and a bounded Turing machine is 
one which is confined to the part of the tape its input is written on. Bounded 
TM's accept the context-sensitive languages p3| . Then: 

Definition. If w is an m x n picture, let raster(w) be io's rows separated by 
marker symbols, w (M ) . . . iU(i, n )t]iU(2,i) ■ ■ ■ W( 2 ,n)\\ ■ ■ ■ t|W(m,i) ■ ■ • W(m,n)- If 10 is a 
d-dimensional picture, we separate its rows with cZ — 1 different marker symbols 

tlx, - - - ta-i- 

Proposition. If L is a d-dimensional language recognizable by a d-dimensional 



(PDA, TM, bounded TM), then raster(L) is recognizable by a one- dimensional 
(2PDA, TM, bounded TM). 



2 




k times 
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Proof. Wc will prove this for d = 2; the generalization to higher dimensions 
is straightforward. 

We need to show that the one-dimensional version of each of these machines 
can simulate its two-dimensional version, by moving "up" (resp. "down" ) in 
raster(w) from Wij to Wi-i,j (resp. Wi+ij). A PDA can do this as follows: scan 
for the first t| to the left of your current position, pushing a symbol x onto the 
stack at each step; there are now j x's on the stack. Then move to the next \\ 
to your left (right), and then move right, popping an x at each step, until there 
are no x's left. You are now in the j'th site in the row to the left (right) of your 
original position. 

A Turing machine can accomplish the same thing by marking its current 
position with an a and marking the next \\ beyond the one to its left (resp. the 
t] to its right) with a 6, and then shuttling back and forth, moving the a to the 
left and the b to the right, until after j steps the a arrives at the t| to the left of 
its original position. The b is now j sites to the right of the appropriate t|. I 

Surprisingly, Acceptance for deterministic two-way PDA's in one dimen- 
sion is decidable in linear time , so we have 

Corollary. Acceptance for deterministic PDA 's in any number of dimen- 
sions can be decided in time proportional to the volume. 

There are several higher types of recognizers that the reader should be aware 
of if she wishes to further explore this subject, such as: 

Alternating Finite Automata (AFA's). These are a generalization of 
NFA's in which the tree of possible trajectories can have both existential nodes, 
that require at least one of their subtrees to accept, and universal nodes, that 
require all of their subtrees to accept (NFA's are just AFA's with only existential 
nodes). In one dimension AFA's only recognize regular languages, but in d > 2 
they are more powerful than NFA's; the relationship between h(LLL)'s and 
AFA's is an open question (H). The Acceptance problem for AFA's is P- 
complete [ji"71 , suggesting that they lie between NFA's and h(LLL)'s in power. 

Pebbling automata. These are finite-state automata that have a fixed 
supply of pebbles, which they can pick up or deposit on sites of the input, and 
sense when they run across them. In one dimension, one-pebble machines can 
only recognize regular languages, even in the alternating case || in two 
dimensions, the reader can easily show that both the NFA language of squares 
with a 2 in the center and the h(LLL) of strips of {a n b n } can be recognized by 
a one-pebble DFA, and an NFA with one pebble can look for cycles in a graph. 

Multi-head finite automata. These are finite-state automata with mul- 
tiple heads which they can move independently on the input. A two-head DFA, 
for instance, can recognize the language {u^w} of words repeated twice with a 
marker in the middle, which is neither regular nor context-free. If DFA(k) and 
NFA(k) are the classes recognized by /c-head DFA's and NFA's, both of these 
form distinct hierarchies (i.e. k + 1 heads are more powerful than k) in d = 1 
and therefore in higher dimensions as well J42[ . A logical characterization of 
fc-head DFA and NFA languages is given in 
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We could also discuss counter h(LLL) 's where the hidden states contain one 
or more integers, with an underlying local rule that can impose inequalities 
between them and those of their neighbors, increment or decrement them from 
neighbor to neighbor, or check for zero. Since these can recognize non-regular 
languages in one dimension such as the Dyck language, they properly contain the 
h(LLL)'s. A counter h(LLL) could also check that a vector field is a gradient, or 
that a directed graph is acyclic, by assigning an altitude s at every site such that 
Si > Sj if there is an edge i — ► j. We believe, but have not been able to prove, 
that ordinary h(LLL)'s cannot recognize this language; they can recognize its 
complement, however, as we showed in Section 2.5. 

5 Conclusion 

We have shown that the notion of "regular language" generalizes in several 
different ways in two or more dimensions: LLL's, DFA's, NFA's and h(LLL)'s. 
The examples given hopefully give the reader an intuition for what each class 
is capable of. 

As tools and applications, we have studied the closure properties of these 
classes, related their Acceptance problems to the complexity classes SAC , L, 
NL and NP, and applied them to the languages generated by cellular automata 
in finite and infinite time. 

We hope that we have given the reader some conceptual tools for the classi- 
fication of two-or-more-dimensional patterns found in her research; or at least 
that she has found the examples and distinctions we have made enjoyable. 

A preliminary version of this work appeared in ]36| ] . 

Acknowledgements. We are grateful to the Niels Bohr Institute where 
this work was begun, the Santa Fe Institute where it was continued, and the 
Bellairs Research Institute of McGill University where it was finally put to rest. 
We also thank Jean-Camille Birget, Marek Chrobak, Tao Jiang, Oliver Matz, 
and David Simplot for helpful communications. This work was supported in 
part by NSF grant ASC-9503162. 

References 

[1] R. Artuso, E. Aurell and P. Cvitanovic, "Recycling strange sets." Nonlin- 
earity 3 (1990) 325-. 

[2] F. Barahona, J. Phys. A: Math. Gen. 15 (1982) 3241. 

[3] Y. Bargury and J. Makowsky, "The Expressive power of Transitive Closure 
and 2-way Multihead Automata." Lecture Notes in Computer Science 626 
1-14. Springer- Verlag, 1992. 



3G 



[4] R.J. Baxter, Exactly Solved Models in Statistical Mechanics. Academic 
Press, London, 1982. 

[5] R. Berger, "The undecidability of the domino problem." Memoirs Amer. 
Math. Soc. 66 (1966) 1-72. 

[6] M. Blum and C. Hewitt, "Automata on a 2-dimensional tape." 8th IEEE 
Symp. on Switching and Automata Theory (1967) 155-160. 

[7] S.A. Cook, "Linear time simulation of deterministic two-way pushdown 
automata." Proc. 1971 IFIP Congress 75-80. 

[8] W. Coy, "Automata in Labyrinths." Lecture Notes in Computer Science 56 
65-71. Springer- Verlag, 1977. 

[9] J. P. Crutchfield and K. Young, "Computation at the onset of chaos." In 
Complexity, Entropy, and the Physics of Information, W.H. Zurek, Ed. 
Addison- Wesley, 1990. 

[10] J. P. Crutchfield, "Unreconstructible at any radius." Physics Letters A 171 
(1992) 52-60. 

[11] S. Finch, "Hard square entropy constant." 

http://www. mathsoft.com/asolve/consta nt/square/square.htm( 

[12] M.R. Carey and D.S. Johnson, Computers and Intractability: A Guide to 
the Theory of ' NF '-Completeness. W.H. Freeman, San Francisco, 1979. 

[13] D. Giammarresi and A. Restivo, "Recognizable picture languages." Int. J. 
of Pattern Recognition and Artificial Intelligence 6(2-3) (1992) 241-256. 

[14] D. Giammarresi, A. Restivo, S. Seibert and W. Thomas. "Monadic second 
order logic over rectangular pictures and recognizability by tiling systems." 
Information and Computation 125(1) (1996) 32-45. 

[15] D. Giammarresi, A. Restivo. "Two-dimensional languages". To appear in 
Handbook of Formal Languages, G. Rosenberg, A. Salomaa Eds. Springer 
Verlag, 1996. 

[16] P. Goralcik, A. Goralcikova, and V. Koubek, "Alternation with a pebble." 
Information Processing Letters 38(1) (1991) 7-13. 

[17] R. Greenlaw, H.J. Hoover, and W.L. Russo, Limits to Parallel Computa- 
tion: P- Completeness Theory. Oxford University Press, 1995. 

[18] B. Grunbaum and G.C. Shepard, Tilings and Patterns. W.H. Freeman, San 
Francisco, 1987. 

[19] V. Guillemin and A. Pollack, Differential Topology. Prentice-Hall, 1974. 



37 



[20] see for instance Ch. 5 of J. Guckcnheimcr and P. Holmes, Nonlinear Os- 
cillations, Dynamical Systems and Bifurcations of Vector Fields. Springer- 
Verlag, 1983. 

[21] Silas Haslam, A General History of Labyrinths. Vienna, 1888. 

[22] G.A. Hcdlund, "Endomorphisms and automorphisms of the shift dynamical 
system." Mathematical Systems Theory 3 (1969) 320-375. 

[23] J.E. Hopcroft and J.D. Ullman, Introduction to Automata Theory, Lan- 
guages, and Computation. Addison- Wesley, 1979. 

[24] L.P. Hurd, "Formal language characterization of cellular automaton limit 
sets." Complex Systems 1 (1987) 69-80. 

[25] L.P. Hurd, "Recursive cellular automata invariant sets." Complex Systems 
4 (1990) 119-129. 

[26] K. Inoue and A. Nakamura, "Some properties of two-dimensional on-line 
tesselation acceptors." Information Sciences 13 (1977) 95-121. 

[27] K. Inoue, A. Nakamura and I. Takanami, "A note on two-dimensional finite 
automata." Information Processing Letters 7(1) (1978) 48-53. 

[28] K. Inoue and A. Nakamura, "Two-dimensional finite automata and unac- 
ceptable functions." Int. J. Comput. Math. A 7 (1979) 207-213. 

[29] K. Inoue and I. Takanami, "A survey of two-dimensional automata theory." 
Information Sciences 55 (1991) 99-121. 

[30] E. Jen, "Global properties of cellular automata." Journal of Statistical 
Physics 43 (1986) 219 -242. 

[31] B. Kitchens and K. Schmidt, "Periodic points, decidability, and Markov 
subgroups." Lecture Notes in Mathematics 1042 440-454. Springer- Verlag, 
1988. 

[32] D. Lind and B. Marcus, Symbolic Dynamics and Coding. Cambridge Uni- 
versity Press, 1995. 

[33] A. Lindenmayer, "Developmental systems without cellular interaction, 
their languages and grammars." Journal of Theoretical Biology 30 (1971) 
455-484. 

[34] K. Lindgren, "Correlations and random information in cellular automata." 
Complex Systems 1 (1987) 529-543. 

[35] K. Lindgren and M.G. Nordahl, "Universal computation in simple one- 
dimensional cellular automata." Complex Systems 4 (1990) 299-318. 



38 



[36] K. Lindgren, C. Moore, and M.G. Nordahl, "Complexity of two-dimensional 
patterns." J. Unpub. Res. 1 (1990) 1-32. 

[37] A. de Luca and S. Varrichio, "A positive pumping condition for regular 
sets." Bulletin of the ETACS 39 (1989) 171-175. 

[38] J. Mikiesz, "The global minimum of energy is not always a sum of local 
minima — a note on frustration." Journal of Statistical Physics 71 (1993) 
425-434. 

[39] D.L. Milgram, "A region crossing problem for array-bounded automata." 
Information and Control 31 (1976) 147-152. 

[40] S. Milosevic, B. Stosic, and T. Stosic, "Towards finding exact residual en- 
tropies of the Ising antiferromagnets" , Physica A 157 (1989) 899-906. 

[41] M. Minsky, Computation: Finite and Infinite Machines. Prentice-Hall, 
1967. 

[42] B. Monien, "Transformational methods and their application to complexity 
problems." Acta Informatica 6 (1976) 95-108; and Corrigenda, 8 (1977) 
383-384. 

[43] C. Moore, "Unpredictability and undecidability in dynamical systems." 
Phys. Rev. Lett. 64 (1990) 2354-2357. and Nonlineamty 4 (1991) 199-230. 

[44] M.G. Nordahl, "Formal languages and finite cellular automata." Complex 
Systems 3 (1989) 63-78. 

[45] A. Nakamura, "Three-dimensional connected pictures are not recognizable 
by finite-state acceptors." Information Sciences 66 (1992) 225-234. 

[46] G.Y. Onoda, P.J. Stcinhardt, D.P. DiVincenzo, and J.E.S. Socolar, "Grow- 
ing perfect quasicrystals." Physical Review Letters 60 (1988) 2653-2656. 

[47] N. Packard and S. Wolfram, "Two-dimensional cellular automata." Journal 
of Statistical Physics 38 (1985) 901-946. 

[48] C.H. Papadimitriou, Computational Complexity. Addison- Wesley, 1994. 

[49] R.J. Parikh, "On context-free languages." Journal of the ACM A (1966) 
570-581 

[50] R.M. Robinson, "Undecidability and nonperiodicity of tilings of the plane." 
Inventiones Math. 12 (1971) 177-. 

[51] A. Rosenfcld, Picture Languages: Formal Models for Picture Recognition. 
Academic Press, 1979. 



39 



[52] A. Salomaa and M. Soittola, Automata- Theoretic Aspects of Formal Power 
Series. Springer- Verlag, New York, 1978. 

[53] A.G. Schlijpcr, "Tiling problems and undecidability in the cluster variation 
method." Journal of Statistical Physics 50 (1988) 689-714. 

[54] J.D. Shore, M. Holzcr, and J. P. Sethna, "Logarithmically slow domain 
growth in nonrandomly frustrated systems: Ising models with competing 
interactions." Physical Review B 46 (1992) 376-404. 

[55] M. Sipscr, "Halting space-bounded computations." Theoretical Computer 
Science 10 (1980) 335-338. 

[56] A. Szepietowski, "Two-dimensional on-line tesselation acceptors are not 
closed under complement", Information Sciences, 64 (1992) 115-120. 

[57] H. Wang, "Proving theorems by pattern recognition II." Bell System Tech. 
J. 40 (1961) 1-42. 

[58] B. Weiss, "Subshifts of finite type and sofic systems." Monatsh. Math. 77 
(1973) 462. 

[59] S. Willson, "On the ergodic theory of cellular automata." Mathematical 
Systems Theory 9 (1975) 132-141. 

[60] S. Wolfram, "Computation theory of cellular automata." Communications 
in Mathematical Physics 96 (1984) 15-57. 



40 



